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ABSTRACT 

The International Union of Basic and Clinical 
Pharmacology (IUPHAR) database, IUPHAR-DB 
(http://www.iuphar-db.org) is an open access, 
online database providing detailed, expert-driven 
annotation of the primary literature on human and 
rodent receptors and other drug targets, together 
with the substances that act on them. The present 
release includes information on the products of 
646 genes from four major protein classes 
(G protein-coupled receptors, nuclear hormone 
receptors, voltage- and ligand-gated ion channels) 
and ~3180 bioactive molecules (endogenous 
ligands, licensed drugs and key pharmacological 
tools) that interact with them. We have described 
previously the classification and curation of data 
for small molecule ligands in the database; in this 
update we have annotated 366 endogenous peptide 
ligands with their amino acid sequences, post- 
translational modifications, links to precursor 
genes, species differences and relationships with 
other molecules in the database (e.g. those 
derived from the same precursor). We have also 
matched targets with their endogenous ligands 
(peptides and small molecules), with particular at- 
tention paid to identifying bioactive peptide ligands 
generated by post-translational modification of 
precursor proteins. Other improvements to the data- 
base include enhanced information on the clinical 
relevance of targets and ligands in the database, 
more extensive links to other databases and a 



pilot project for the curation of enzymes as drug 
targets. 

INTRODUCTION 

The International Union of Basic and Clinical 
Pharmacology (IUPHAR) database, IUPHAR-DB 
(http://www.iuphar-db.org/), is now in its 10th year 
(1,2), and provides expert-driven, manually curated anno- 
tation of the pharmacological, physiological and genetic 
properties of human and rodent receptors and other drug 
targets, together with the substances that act on them. 
Developed under the auspices of the IUPHAR 
Committee on Receptor Nomenclature and Drug 
Classification (NC-IUPHAR), IUPHAR-DB is an au- 
thoritative reference and educational resource for pharma- 
cologists, clinicians and allied disciplines, receiving over 
7000 unique visitors from 115 countries monthly. Here, 
we review enhancements to the data content and website 
since the last published update (2), including the addition 
of information on endogenous ligands of receptors, anno- 
tation of structural data for endogenous peptide ligands, a 
pilot project to curate enzymes as drug targets, increased 
database links and expansion of data on the clinical rele- 
vance of targets and ligands in the database. 

CONTENT 

In its present release, IUPHAR-DB covers the complete 
genomic complement of 4 protein superfamilies in human, 
rat and mouse: non-sensory G protein-coupled receptors 
(GPCRs), nuclear hormone receptors (NHRs), voltage- 
gated ion channels (VGICs) and ligand-gated ion channels 
(LGICs), as well as pilot data on 10 enzymes (from the 
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lanosterol biosynthesis pathway). All data are manually 
curated by expert scientists and curators and referenced to 
primary literature citations in PubMed (http://www.ncbi 
.nlm.nih.gov/pubmed). Information provided about 
targets includes their nomenclature, structure, function, 
physiology, clinical relevance and key ligands (endogenous 
ligands, licensed drugs, pharmacological tools and radio- 
ligands). Targets are linked to genomic databases such as 
Entrez Gene (3), Ensembl (4) and OMIM (Online 
Mendelian Inheritance in Man) (5), protein databases 
such as UniProt (6) and specialist pharmacological and me- 
dicinal chemistry databases such as DrugBank (7) and 
ChEMBL (8). The database presently includes information 
on over 8000 interactions between these targets and ~3 1 80 
small molecules, natural products, peptides and inorganic 
compounds. Database pages for ligands provide informa- 
tion on their structures, physicochemical properties, 
synonyms, binding affinities and actions at targets, lists of 
similar substances and links to chemistry databases 
including PubChem (9), ChemSpider (10) and Chemical 
Entities of Biological Interest (ChEBI) (11). All data are 
subject to yearly review and updated as required to reflect 
advances in the field. Table 1 lists the receptor and ion 
channel database pages that have been updated since the 
last published report (2). 

NEW FEATURES 

Annotation of endogenous ligands 

IUPHAR-DB target pages list ligands recommended for 
experimental use including endogenous ligands, synthetic 



drugs and radiolabeled forms, together with their actions 
and quantitative data. Since the lists can be quite exten- 
sive, NC-IUPHAR is working to identify a subset of 
ligands with optimal properties for preclinical and trans- 
lational research. A crucial first step is to identify the prin- 
cipal endogenous ligands with activity at targets. To this 
end IUPHAR-DB curators have identified through litera- 
ture and database searches 863 endogenous ligands for 
GPCRs and NHRs, and highlighted these on target 
pages together with additional comments. Likewise, the 
database pages for endogenous ligands clearly identify 
their natural targets under the 'Biological Activity' tab. 

Linking endogenous small molecule ligands such as 
monoamines to their cognate receptors was straightfor- 
ward because most were already described in IUPHAR- 
DB. However, the task proved more challenging for 
peptides due to the difficulty in distinguishing between 
endogenous peptides synthesized by post-translational 
processing of polypeptide precursors and peptide frag- 
ments produced synthetically in order to study struc- 
ture-activity relationships. It can be very difficult to 
prove the existence, or otherwise, of particular peptide 
fragments in vivo, many of which may be present at only 
low concentrations, in distinct tissues or under specific 
physiological conditions. As a result there is a disjunction 
between sequences listed in online protein databases and 
information in the literature on their activities at targets. 
We sought to bridge this divide by annotating endogenous 
peptides described in pharmacological literature with 
structural information and by defining their relationships 
with other peptides. Endogenous ligands were identified 



Table 1. IUPHAR-DB receptor and ion channel pages that have been updated since the last published database report 



Receptor family 


Details 


Date of public release 


Dopamine 


Annotation added 


August 2010 


Formylpeptide 


Annotation added 


August 2010 


Leukotriene 


Annotation added 


August 2010 


Motilin 


Annotation added 


August 2010 


Tachykinin 


Annotation added 


August 2010 


Vasopressin and oxytocin 


Annotation added 


August 2010 


Hydrocarboxylic acid 


Annotation added 


February 2011 


Lysophospholipid 


Annotation added 


February 2011 


Frizzled 


Annotation added (FZD3 and FZD4) 


February 2011 


Orphan GPCRs 


Annotation added (GPR39, LPAR6, GPR35, GPR87 and P2RY10) 


February 2011 


Ionotropic glutamate 


Annotation added (GluA2) 


February 2011 


GABAa 


Annotation added (5,E,6,n) 


February 2011 


P2X 


Annotation added (P2X7) 


February 2011 


P2X 


Annotation added (P2X3) 


May 2011 


GABA A 


Annotation added (P2.y3) 


May 2011 


Nicotinic acetylcholine 


Annotation of all subunits 


September 201 1 


VIP and PACAP 


Updated 


September 2011 


cc-adrenoceptors 


Updated 


February 2012 


P-adrenoceptors 


Updated 


February 2012 


Calcitonin 


Updated 


February 2012 


Calcium sensing 


Updated 


February 2012 


Cannabinoid 


Updated 


February 2012 


Orexin 


Updated 


February 2012 


GABAa 


Updated (tn3,oi4,ot5,p3,yl,y2,8,E,e) 


February 2012 


Neuropeptide FF/neuropeptide AF 


Updated 


July 2012 


Neurotensin 


Updated 


July 2012 


Vasopressin and oxytocin 


Updated 


July 2012 


P2X 


Updated (P2X3) 


July 2012 
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LIgand Id: 1697 



Ugand name orexin-A 



Summary | [| \j [. \] 



Compound class 



Endogenous peptide in human, r 



Protein name 



Previous gene n 



Database Links O 

CAS Registry No. 

UnlProt 043612 (Hs), 055241 (Mm), 055232 (Rn) 



20S640-90-0 



y refer t 



For m 



please contact: c 



B 



LIgand Id: 3732 



LIgand name LH p subunit 



Summary || Biological activity || Refere 



Similar ligands | 



Ser-Arg-Glu-Pro-Leu-Arg-Pro-Trp-Cys-Hls-Pro-Ile-Asn-Ala-Ile-Leu-Ala-Val-Glu-Lys-Glu-Gly-Cys-Pro-Val-Cys-Ile-Thr- 

Val-Asn-Thr-Thr-Ile-Cys-Ala-Gly-Tyr-Cys-Pro-T^ 

val-Val-Cys-Thr-Tyr-Arg-Asp-Val-Arg-Phe-Glu-Ser-Ile-Arg-l^ 

Ser-Phe-Pro-Val-Ala-Leu-Ser-Cys-Arg-Cys-Gly-Pro-Cys-Arg-Arg-Ser-Thr-Ser-Asp-Cys-Gly-Gly-Pro-Lys-Asp-Hls- 
Pro - Le u -Th r-Cy s - Asp- H Is- Pro -Gin- Leu - Ser-G ly- Leu - Leu - Phe - Le u 



Search for 3D structures on the PDB 

Search by keyword: LH p subunit 



Search by accession number: P01229 



Post-translation a I Modification 

N-linked glycosylatlon at residue 50; disulphide bonds between residues 29 and 77, 43 and 92, 46 and 130, 54 and 
108, 58 and 110 and 113 and 120. The production of biologically active LH requires appropriately folded and 
glycosylated LUG and common o subunlts that assemble to form the active LH heterodlmer [1]. 



Download 2D Structure O 

No information available. 



Ligand Id: 1152 


Ligand name 


VIP 


Species 


Human, Mouse, Rat 



^Tii ..,.„,„<., iri^ 



Similar ligands | 



For advanced searching click here to open chemical structure editor 



PHI {Sp: Pig} 


IUPHAR-DB targets: VPAC,, VPAC 2 


PHM {Sp: Human} 


iuphar-db targets: pac,, VPAC t 


PHV {Sp: Rat} 


IUPHAR-DB targets: pac,, VPAC,, VPAC Z 


PHV {Sp: Human} 


IUPHAR-DB targets: pac. 


PHI {Sp: Mouse, Rat} 


IUPHAR-DB targets: PAC,, VPAC^ VPAC Z 




Other Similar Sequences © 


N-stearyl-[Nle 17 ] neurotensin(6-ll)/VIP(7-28) 


IUPHAR-DB targets: PAC,, VPAC^ VPAC 2 


BAY SS-9837 


IUPHAR-DB targets: PAC,, VPAC,, VPAC a 


PG 99-465 


IUPHAR-DB targets: pac,, VPAC,, VPAC 2 


['"l]VIP 


IUPHAR-DB targets: pac,, vpac,, VPAC S 
tiKAi targets: vpac,, vpac 2 


LAra"'"' !B ]VIP 


IUPHAR-DB targets: PAC,, VPAC,, VPAC 2 
GRAC targets: VPAC, 



Figure 1. Panel (A) shows a screenshot of the ligand page for the en- 
dogenous peptide orexin-A open at the 'Summary' tab. The sequence of 
orexin-A is identical in human, mouse and rat so the information on 
this peptide is grouped into a single database entry. Panel (B) shows a 
screenshot of the LH (3 subunit ligand page, open at the 'Structure' tab 
and showing the peptide sequence, post-translational modifications and 
a link to the ligand page for the LH heterodimer. Panel (C) shows a 
screenshot of the ligand page for vasoactive intestinal peptide (VIP), 



through detailed reviews of the literature and interroga- 
tion of sequence databases such as UniProt. In total, 366 
distinct endogenous peptide sequences were identified as 
ligands of human, rat and mouse GPCRs. 

Reorganized ligand classification 

The IUPHAR-DB ligand classification has been 
reorganized. The 'Peptide' category is now divided into 
'Endogenous Peptides' and 'Other Peptides'. For this 
purpose, endogenous peptides were defined as peptides 
encoded in the genomes of the three species covered by 
IUPHAR-DB, human, rat and mouse. All other peptides 
including synthetic peptides, semi-synthetic modified 
forms and peptides endogenous to other species (e.g. 
conotoxins) are now classified as 'Other Peptides'. 
Endogenous small molecules such as hormones, neuro- 
transmitters and metabolites have been placed into a 
category named generally as 'Metabolites' (formerly 
'Small Organics'), which also includes very close ana- 
logues such as radiolabeled forms. All other synthetic 
compounds may be found under the 'Synthetic Organics' 
category. 

Enhanced peptide information 

In the last published update (2) we described the curation 
of structural information and descriptors for ~2000 small 
molecule ligands in IUPHAR-DB. This information has 
now been enhanced by the curation of structural data for 
the 366 human, rat and mouse endogenous peptide se- 
quences described earlier. For each endogenous peptide 
ligand, a range of information has been collected, 
including nomenclature, single- and three-letter amino 
acid sequences, post-translational modifications, 
encoding genes and precursor proteins for human, rat 
and mouse. Where the mature peptide was identical in 
structure between species, the information on all the 
encoding genes and precursor proteins was collated 
under the same database entry (Figure 1A). Where the 
mature peptide differed in sequence between species, 
separate database entries where created for each species 
with the relevant structural, precursor and genetic infor- 
mation included. Database links are provided to genetic, 
protein and chemistry databases including the HUGO 
Gene Nomenclature Committee (HGNC) database (12), 
Rat Genome Database (RGD) (13), Mouse Genome 
Database (MGD) at the Mouse Genome Informatics 
(MGI) website (14), Ensembl, UniProt, PubChem, 
ChEBI, and, where available, to experimental 3D struc- 
tures in the RCSB Protein Data Bank (15). 

In some cases, IUPHAR-DB also provides information 
on endogenous peptides in species other than human, 
mouse and rat where these are commonly used as experi- 
mental tools at human or rodent targets. Ambiguities in 



Figure 1. Continued 

open at the 'Similar Ligands' tab. The 'Related Sequences' table lists 
other peptides derived from the same precursor or orthologues in other 
species, whereas the 'Other Similar Sequences' table lists other peptides 
that have similar sequences to VIP, including synthetic and modified 
forms of VIP. 
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the literature on the peptide hormone secretin illustrate 
the need for this work. Secretin was first isolated from 
the pig and porcine secretin is frequently used in assays 
of secretin receptor function, even though it differs in 
amino acid sequence from the endogenous human and 
rodent peptides (16). Re-evaluation of the literature on 
secretin in the course of this project enabled us to 
provide separate data on the activities of pig, human 
and rodent secretins. 

Some hormones are formed by the covalent association 
of multiple peptide fragments of a single polypeptide pre- 
cursor (e.g. the A and B chains of insulin (17) and related 
hormones). In other cases, polypeptide subunits encoded 
by more than one gene combine to form the active ligand; 
the exact combination can determine target specificity. 
For example, the glycoprotein hormones follicle- 
stimulating hormone (FSH), thyroid-stimulating 
hormone (TSH), luteinizing hormone (LH) and human 
chorionic gonadotropin (hCG) all share a common a 
subunit with different (3 subunits conferring their target 
specificity (18). The database now supports multi-subunit 
polypeptide ligands, with distinct entries created for the 
intact hormone and for each component polypeptide 
chain. Cross-links between functional ligand and compo- 
nent subunit pages are provided via the 'Structure' tab. 
Sequence and precursor information is provided under 
the subunit entry (e.g. LH (3 subunit, Figure IB) and bio- 
logical activity data can be found on the page for the 
functional ligand (e.g. LH). 

A new 'Related Sequences' table has been added to en- 
dogenous peptide pages, which lists other peptides in the 
database that have the same parent precursor polypeptide, 
as well as corresponding peptides from other species. The 
'Other Similar Sequences' table lists additional endogenous 
and synthetic peptides with similar sequences, identified by 
clustering with the h-cd-hit program, part of the CD-HIT 
Suite (19). Together, these tables allow users to easily 
identify similar peptides in the database without any 
prior knowledge of their structures (Figure 1C). 

Overall, this is a unique resource bringing together en- 
dogenous peptide sequences with quantitative data on 
their actions at targets, which complements detailed struc- 
tural data available in UniProt, and pathway information 
available in the Kyoto Encyclopedia of Genes and 
Genomes (KEGG) (20). Work is currently under way to 
curate sequence and structural information on modified 
peptides containing non-natural amino acids, 
pseudo-peptide bonds, and/or cyclization and link these 
to activity data in IUPHAR-DB. 

Clinical relevance of drugs and targets 

Information on the clinical relevance of targets and ligands 
in IUPHAR-DB has been enhanced by the addition of data 
on the status and nomenclature of approved drugs. By 
matching structures to entries in DrugBank, 300 small 
molecule drugs with United States Food and Drug 
Administration (FDA) approval for clinical use in 
humans, and 6 drugs now withdrawn from clinical use, 
were identified and marked as 'approved' or 'withdrawn', 
respectively. DrugBank includes a range of useful data on 



clinically used drugs such as indication, mechanism of 
action, absorption, metabolism, toxicity, drug-drug and 
drug-target interactions. Links are provided to 
DrugBank target pages and DrugBank chemical classifica- 
tions and drug page links provided for ligands. Structure- 
based searching also enabled us to assign International 
Nonproprietary Names (INNs) to 420 of the ligands in 
IUPHAR-DB. INNs are the official non-proprietary or 
generic names given to pharmaceutical substances, as 
designated by the World Health Organization (WHO). 

A collaboration with Orphanet, the portal for rare 
diseases and orphan drugs (21), has led to the addition 
of data on rare diseases which are associated with 
targets in IUPHAR-DB. Each rare disease listing in 
IUPHAR-DB includes a brief description and links to 
expert-curated summaries in OMIM and Orphanet. 

Radiolabeled analogues 

Information on assays involving radiolabeled ligands can 
be difficult to find in literature as labelled forms are not 
often mentioned in abstracts. IUPHAR-DB ligand pages 
now feature a quick way to identify radiolabeled forms 
described in the database via a new 'Radio Analogues' tab. 
Labelled ligands are linked to relevant literature refer- 
ences, accessible via a 'References' tab on ligand pages. 
This feature provides a useful complement to a longer 
list of structurally similar ligands that may be found 
under the 'Similar Ligands' tab along with links to their 
targets. 

Enzymes as drug targets 

As a pilot study for the future curation of enzymes as drug 
targets in IUPHAR-DB, we have curated data for the 10 
enzymes of the lanosterol biosynthesis pathway, including 
HMG-CoA reductase, the rate-limiting enzyme in choles- 
terol synthesis and the target of the statin drugs used to 
treat hypercholesterolemia. Information provided about 
enzymes includes structural and genomic data, catalysed 
reactions, Enzyme Commission (EC) numbers, kinetic 
data, substrates, cofactors, selected inhibitors and their 
affinities. Links are provided to other databases including 
BRENDA, the Comprehensive Enzyme Information 
System (22), the International Union of Biochemistry 
and Molecular Biology (IUBMB) Enzyme Nomenclature 
Database (http : / /www .chem.qmul .ac.uk /iubmb /enzyme /) 
and KEGG. The information in IUPHAR-DB comple- 
ments that available in enzyme databases such as 
BRENDA, which organizes enzymes by EC number 
across the tree of life, and bioactivity databases such as 
ChEMBL, which catalogues the results of published me- 
dicinal chemistry screens. The IUPHAR-DB approach is 
focussed on curation of data relevant to individual 
enzymes as human drug targets. 

New database links 

New database links have been provided from target pages 
to Ensembl, a genomic database, DrugBank, a database of 
drugs and their targets, Orphanet, the portal for rare 
diseases and orphan drugs, InterPro (23), a protein signa- 
ture database, ChEMBL, a database of bioactive drug-like 
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small molecules from medicinal chemistry literature, and 
TreeFam (24), a database of phylogenetic trees. Ligands 
are now linked to DrugBank, ChEMBL, ChemSpider, the 
free chemical database from the Royal Society of 
Chemistry, BindingDB (25), a database of measured 
binding affinities, ZINC (26), a database of commercially 
available compounds for virtual screening, and the 
Human Metabolome Database (27), a database of small 
molecule metabolites in humans. Additional new database 
links for endogenous peptide ligands are described above 
in the section on 'Enhanced peptide information'. 

Links with Guide to PHARMACOLOGY 

NC-IUPHAR has established a collaboration with the 
editors and publishers of the British Pharmacological 
Society (BPS) Guide to Receptors and Channels 
(GRAC) (28) to make IUPHAR-DB data available 
through a new portal, Guide to PHARMACOLOGY 
(http://www.guidetopharmacology.org). The Guide to 
PHARMACOLOGY includes a new open-access 
database of information from GRAC and IUPHAR-DB 
with search tools and guidelines for pharmacology. 
GRAC covers a wider range of targets than 
IUPHAR-DB, and for each target lists key selective 
ligands and probes, recommended literature references 
and expert summaries of their properties. The 
IUPHAR-DB website includes links to Guide to 
PHARMACOLOGY pages and ligand pages include a 
summary of the data in GRAC. 

FUTURE PERSPECTIVES 

Work is currently under way to curate structural informa- 
tion for the last remaining group of ligands in 
IUPHAR-DB that currently lack this information: the 
696 peptides not endogenous to human, rat and mouse 
species, which includes toxins, synthetic, radiolabeled 
and modified peptides. In the near future, we plan to 
add a sequence-based search facility for peptides to com- 
plement existing structure-based search tools for small 
molecules. The range of targets will be expanded to en- 
compass all the human targets of current licensed drugs 
and those of probable future interest, with initial effort 
towards curation of enzymes and receptor tyrosine 
kinases. NC-IUPHAR and the BPS are also working 
towards better integration between IUPHAR-DB and 
GRAC, ensuring consistency of information but retaining 
their distinctive focus, with GRAC providing a summary 
of the key properties and ligands for a wide range of 
targets and IUPHAR-DB providing detailed annotation 
for a narrower range of important targets. 

DATA ACCESS 

IUPHAR-DB is freely available to access online at http:// 
www.iuphar-db.org. The website includes downloadable 
files containing current receptor and channel lists, 
NC-IUPHAR nomenclature, synonyms, genetic informa- 
tion, HGNC gene nomenclature and identifiers and other 
database accessions. Other file formats are available by 



emailing curators@iuphar-db.org. Information on 
linking to IUPHAR-DB pages is provided at http:// 
www.iuphar-db.org/linking.jsp. In the near future, we 
plan to enable computational access to the database via 
Web services and to extend the range of options for data 
download via the website. 



CITING IUPHAR-DB 

For a general citation of the resource we recommend 
citing this article. For citing specific target pages we 
recommend a format similar to the following example: 
A. P. Davenport, E. J. Mead. Kisspeptin receptor. Last 
modified on <date>. Accessed on <date>. IUPHAR 
database (IUPHAR-DB), http://www.iuphar-db 
.org/DATABASE/FamilyMenuForward?familyId = 34. 
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